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Data Handling System 

This invention relates to a data handling system, and in particular a device for 
organising and storing data for subsequent retrieval. 
5 The advent of low cost digital cameras, cheap storage space and the vast 

quantity of media available has transformed the personal computer into a multi-purpose 
home entertainment centre. The availability of such a quantity of media dictates that the 
user should be able to find materia! quickly, whether stored by the user himself or 
available over a communications system. One of the most versatile communications 
10 networks is the "Internet", which provides worldwide transmission of recorded digital 
media files representing, for example, text, sound, pictures, moving images, numerical 
data, or software. 

Note that although the plural 'media* is used throughout this specification, the 
term 'media file 1 is to be understood to include files which are intended to be conveyed to 

15 a user by only one medium - e.g. text or speech - as well as 'multimedia' files which 
convey information using a plurality of such media. 

The number of media files accessible via the Internet is very large, so there is a 
need to label media files with some description of what they contain in order to allow 
efficient searching or cataloguing of media. It is therefore desirable to add metadata to the 

20 individual objects in the media. Metadata are data that relate to the content or context of 
the object and allow the data to be sorted - for example it is common for digital data of 
many kinds to have their date of creation, or their amendment history, recorded in a form 
that can be retrieved. Thus, for example, HTML (HyperText Mark-up Language) files 
relating to an Internet web page may contain 'meta' tags that include keywords indicating 

25 what subjects are covered in the web-page presented to the user. Alternatively, the 
keywords may be attached to a separate metadata object, which contains a reference to 
an address allowing access to the item of data itself. The metadata object may be stored 
locally, or it may be accessible over a long-distance communications medium, such as the 
Internet. Such addresses will be referred to herein as "media objects", to distinguish them 

30 from the actual media files to be found at the addresses indicated by the media objects. 
The expression 'media object' includes media data files, streams, or a set of pointers into 
a file or database. 

The structure of an individual media object consists of a number of metadata 
elements, which represent the various categories under which the object, (or more 
35 properly the information contained in the media file to which the object relates) may be 
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classified. For example a series of video clips may have metadata elements relating to 
"actors", "locations" "date of creation", and timing information such as "plot development" 
or "playback order", etc. For each element, any given media object may be allocated one 
or more metadata values, or classification terms, from a vocabulary of such terms. The 
5 vocabulary will, of course, vary from one element to another. 

The metadata elements and their vocabularies are selected by the user according to 
what terms he would find useful for the particular task in hand: for example the values in 
the vocabulary for the metadata element relating to "actors" may be "Tom", "Dick", and 
"Harriet", those for "location" might include "interior of Tom's house", "Vienna street 

10 scene", and "beach", whilst those for "plot development" might include "Prologue", 
"Exposition", "Development", "Climax", "Denouement", and "Epilogue". Note that some 
metadata elements may take multiple values, for example two or more actors may appear 
in the same video clip. Others, such as location, may be mutually exclusive. 

A user typically stores video and audio files and digital pictures in a hierarchical 

1 5 directory structure, classifying the media for subsequent retrieval. This is far from ideal as 
it is often impossible to decide how much detail is enough to organise a media library. In 
particular it often requires more technical skill than the user may have, in particular in a 
home context. In a business context the skills may be available, but the database may 
need to be accessed by several different people with different needs, not all of whom 

20 would have the necessary skills to generate suitable classification terms. The meticulous 
compilation of metadata is often a tedious process of trial and error, and requires the 
expenditure of substantial human resources. Moreover, the person performing the 
classification is unlikely to add metadata beyond what is sufficient to achieve the results 
he requires. Consequently, the results may be of little use to subsequent users if the 

25 database is reused. In particular, it is difficult to ascertain whether all media objects have 
been allocated all the metadata that might be appropriate to them, nor how useful an 
individual metadata category may be in identifying useful material. Some metadata may 
apply to a large number of items, in which case additional metadata, detailing variations 
between the items identified thereby, may prove useful. Conversely, a category allocated 

30 to very few items, or none at all, may be indicative of an area in which more media items 
should be obtained, or that for some reason the metadata have not been applied to items 
to which it would have been appropriate. Such considerations are difficult to address with 
existing systems. Data Clustering and Data mining algorithms such as the Minimal 
Spanning tree algorithm and K- means algorithm have conventionally been used to 
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analyse databases and to attempt to fill out missing data. These algorithms are slow and 
usually run offline. 

International patent application WO 02/057959 (Adobe) describes an apparatus 
to visually query a database and provide marking-up codes for its content. However, it 
5 does not let the user visualise how many marking-up codes are already present in the 
individual items making up the database. The interface described therein is not capable of 
handling complicated metadata schemas, as it would require too many 'tags', and this 
would lead to an extremely complicated interface that would not easily indicate which 
objects still have to be marked up. As in most applications that require metadata input, the 
10 main interface to adding metadata is text-based. Menus with pre-defined vocabularies 
can be called up to provide marking-up codes to objects or content. 

The present invention seeks to simplify the metadata marking-up process. 
According to the present invention, there is provided: a data handling device for 
organising and storing media objects for subsequent retrieval, comprising a display for 
15 displaying representations of the media objects, a user input device for selecting an 
individual media object, and data storage means for allocating the selected media object 
to a designated category selected using the user input device, wherein a representation of 
the selected . media object is placed in a region of the display representing the 
designated category. 

20 According to another aspect, the invention provides a method of organising and 

storing media objects for subsequent retrieval, in which representations of the media 
objects are displayed, a user selects an individual media object from the display, and 
allocates the selected media object to a designated category by placing a representation 
of the selected media object in a region of the display representing the designated 
25 category, whereby data relating to the selected media object is stored associating it with 
the category so identified. 

The invention provides an interface that can be used to visually add metadata to 
a database or collection of media objects. By placing representations of the elements in 
display areas representing the individual categories, a number of advantages are 
30 achieved. In particular, the number of items to which each item of metadata has been 
applied can be readily recognised, as all such items are collected in one region of the 
display. In addition, the size of the display area can be made proportional to the number of 
items it contains. This makes clusters of data easy for a user to identify and sort. A visual 
interface allows the differentiation or segmentation of existing groups, and the filling of 
35 missing metadata elements. It allows the user to sort the items into categories, which is a 
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more natural process than applying the classification terms that relate to such categories 
to individual items. 

The metadata marking-up process is preferably carried out by moving icons 
representing media objects between display areas that represent pre-defined vocabulary 
5 values for a given metadata element. A representation of the metadata is built up in terms 

"of sets similar to Venn diagrams. However, Venn diagrams representing more than a very 

few different sets become very complex. Figure 2 shows a Venn diagram for six sets. The 
regions 1,2,3,4,5,6 each contain only one of the six sets, whilst the region 123456 is the 
union of all six sets. It can be seen that, even for only six sets, the diagram is extremely 

10 complex and difficult to follow - there are 2 6 = 64 separate areas, representing the 
different possible combinations of the six sets. The invention uses a much simpler 
representation, but offers various functions allowing ready manipulation of the data 
represented. This representation offers visual hints to the user as to how far the marking- 
up has progressed, and which metadata elements are missing. 

15 In a preferred embodiment, the user may select a plurality of catagories, giving 

ready visualisation of the metadata. He may take a multi-dimensional view of the search 
space, requesting several data elements, to any of which one or more of a plurality of 
values have been applied. (Effectively the "union" of the sets of objects having those 
values). Alternatively, he may take an "intersection" view, searching only for objects which 

20 have each had all of a plurality of specified values applied to them. Where a large number 
of such intersections are possible to be defined, the user may be allowed to control the 
maximum number of categories to be displayed. The size of the display area allocated to 
each category may be made proportional to the number of media objects portrayed 
therein. 

25 In a preferred embodiment icons representing the media objects are capable of 

being moved between regions of the display area representing categories having pre- 
defined vocabulary values. To change the values associated with a media object, its 
representative icon may be removed from one display area when added to another. If the 
values are not mutually exclusive, it may instead remain in the first display area, with a 

30 copy placed in the additional area. 

The invention may be configured to allow a user to generate additional categories 
having new vocabulary values, such that the media objects may be further categorised. 

Means may be provided for indicating the number of categories associated with 
one or more media objects, and in particular to identify media objects to which no 

35 categories have been applied. 
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Means may be provided for selecting a subset of the media objects for allocating to 
a given category. 

The invention also extends to a computer program or suite of computer programs 
for use with one or more computers to provide an apparatus, or to perform the methods, 

5 of the invention as set out above. 

A useful feature of the invention is that it guides the user to complete the 
minimum necessary marking-up to accomplish his task, and to provide a more even 
distribution of media objects. It does this by providing visual cues to help complete 
missing marking-up information and also provide a visual representation of the existing 
10 level of marking-up or completeness of a database. In particular, it may provide an 
indication of media objects for which no marking-up information has yet been applied for a 
particular element. In the described embodiment this takes the form of a display area in 
which "unclassified" objects are to be found. 

The invention allows the visualisation of the metadata allocated to a given media 
15 object in the context of the complete database of media objects. It also helps to generate 
and modify existing vocabularies. It lets the user create, modify or delete vocabularies 
during the markup process - for example by adding a new vocabulary value to a cluster, 
separating a cluster by adding several new vocabulary values, or changing the name of a 
vocabulary value after observing the objects in that cluster. 
20 Some classification tasks could be accomplished automatically by running 

clustering algorithms on the database, but these processes consume time and resources. 
Providing the user with an effective interface lets him carry out these tasks much more 
quickly. The invention provides a more natural interface than the text-based prior art 
systems. It allows the user to sort the objects into categories (values) and creates and 
25 adjusts the metadata for each object based on how the user does this. 

In a large database, there are likely to be too many individual elements in the 
metadata structure, or values in their vocabularies, to allow all of them to be displayed 
simultaneously. In the preferred arrangement an hierarchical structure is used to allow 
the user to select objects having specified elements and values, thus making the interface 
30 less cluttered by letting the user view only those metadata objects having elements in 
which he is currently interested. He may then sort them using different search terms 
(elements). 

As will be understood by those skilled in the art, the invention may be 
implemented in software, any or all of which may be contained on various transmission 
35 and/or storage mediums such as a floppy disc, CD-ROM, or magnetic tape so that the 
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program can be loaded onto one or more general purpose computers or could be 
downloaded over a computer network using a suitable transmission medium. The 
computer program product used to implement the invention may be embodied on any 
suitable carrier readable by a suitable computer input device, such as CD-ROM, optically 
5 readable marks, magnetic media, punched card or tape, or on an electromagnetic or 
optical signal. 

An embodiment of the invention will now be further described, by way of example 
only, with reference to the drawings, in which: 

Figure 1 is a schematic diagram of a typical architecture for a computer on which 
10 software implementing the invention can be run, 

Figure 2 is a representation of a six-set Venn Diagram 

Figure 3 is a flow diagram showing a simplified view of the processes performed 
by the invention. 

Figure 4 is a representation of a screen shot generated during a single dimensional 
15 viewing process 

Figure 5 is a representation of a screen shot generated during a more complex 
multi-dimensional viewing process 

Figure 6 is a representation of a screen shot generated during a viewing process 
including intersections of metadata sets 

20 

Figure 1 shows an architecture for a computer suitable for running software 
implementing the invention. The computer comprises a central processing unit (CPU) 10 
for executing computer programs, and managing and controlling the operation of the 
computer. The CPU 10 is connected to a number of devices via a bus 11. These devices 

25 include a first storage device 12, for example a hard disk drive for storing system and 
application software, a second storage device 1 3 such as a floppy disk drive or CD/DVD 
drive for reading data from and/or writing data to a removable storage medium and 
memory devices including ROM 14 and RAM 15. The computer further includes a 
network card 16 for interfacing to a network. The computer can also include user 

30 input/output devices such as a mouse 17 and keyboard 18 connected to the bus 11 via an 
input/output port 19, as well as a display 20. The person skilled in the art will understand 
that the architecture described herein is not limiting, but is merely an example of a typical 
computer architecture. It will be further understood that the described computer has all 
the necessary operating system and application software to enable it to fulfil its purpose. 
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The embodiment provides three main facilities for handling a media object. In this 
example it is a software object that contains a reference to a video/audio clip and contains 
metadata about it. 

The first of these facilities is a simple visual interface, which lets the user add 
5 metadata by moving icons or "thumbnail" images representing media objects between two 
areas using a control device - a process known as "drag and drop". This is shown in 
Figure 4. Figure 4 also illustrates the selection of sorting criteria from a hierarchical menu 
list 30. This allows a user to quickly switch between different dimensions or layers, or 
between different metadata elements. In this example the metadata element 
10 "classification" (represented by the element 40 in the hierarchical display 30) has been 
selected, and a number of values from the vocabulary for that element are displayed - 
"Advertising" 401, "affairs" 402, "documentary" 403, etc. These define sets that are made 
up of media objects having these respective values for the "classification" metadata 
element. 

15 Media objects to which no value has yet been applied for this specified metadata 

element are displayed in a separate field 400. This second facility allows unmarked 
media objects to be identified so that the user can perform the marking-up operation. 

The third facility, illustrated in Figures 5 and 6, allows a set of media objects that 
have one or more common elements to be identified, and allows the user to separate or 

20 differentiate the media objects by adding new or different metadata, or to find some other 
criteria that achieve a further distinction between the media objects. 

The query process, which is common to all the variants, will now be described 
with reference to Figure 4. 

The process can be started in either of two modes - either directly acting on the 

25 entire database, or from the template view. The latter approach allows mark-up of a single 
cluster in the database, and segmentation of only the objects in that cluster. It also 
provides a less cluttered view for the user. In doing this it prevents unnecessary metadata 
from being added. Viewing this collection of objects within the visual marking-up tools lets 
the user easily visualise any metadata that might differentiate them, and if there is 

30 insufficient differentiation it allows the user to modify existing metadata to represent the 
object more accurately. 

A metadata element, or combination of such elements, is selected by selecting 
one or more categories 32 in the hierarchical structure 30 (step 70 - Figure 3). A control 
element of the user interface, containing a textual or graphical representation of the 

35 selections available, may be populated by appending an additional query to the template 
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view described in the previous paragraph, identifying the media objects to which marking- 
up is to be applied. By populating the control elements in this manner it is possible to 
visualise the metadata marking-up in the entire database, or a defined subset of it, using 
any of the views described below. The media objects may be represented as either 
5 miniature images, as illustrated in the Figures ("Thumbnail" view) or as a plain text 
filename (Text-only view). 

To add annotation to a particular element in the media object data model, it is first 
necessary to select the desired element in the hierarchical menu structure. As only a 
single class has been selected in this example, the extra steps 72-75 are omitted (these 

10 will be discussed later) and a set of List controls or boxes 400 - 41 1 is generated for each 
value in the vocabulary of available terms stored for each respective element (step 71). 
Existing metadata can be visualised by the manner in which media objects are arranged 
in the different boxes. Media objects that are in the 'unclassified' box 400 do not contain 
any metadata values for the particular element selected in the hierarchical menu structure. 

15 If the metadata values are not mutually exclusive, media objects may appear in more than 
one box. 

The process requires a number of additional steps 72-75 if more than one 
category has been selected, as will be discussed later with reference to Figures 5 and 6. 
However, if only a single category has been selected (step 701), a single-category view is 
20 generated (step 71). In the example shown in Figure 4 the "Classification" element 41 has 
been selected under the "Creation" heading 40, and a number of metadata classes - 
"advertising" 401, "affairs" 402, "documentary" 403, "drama" 404, "Education" 405, "Film" 
406, etc are displayed as windows, each containing those media objects to which that 
category has been applied. Each individual media object 101, 102, etc is represented in 
25 the appropriate window by "thumbnail" images or other suitable means. 

The size of the display area representing each category is determined according 
to the number of media objects to be displayed. Large groups, such as those shown at 
401, 405, may be partially represented, with means 499 for scrolling through them. Thus a 
view of the various media objects 101, 102, 103, 104, 105, 106, 107 etc, is generated, 
30 sorted according to the various "classification" categories (metadata values) 401 , 402, 403 
etc including an "unclassified" value 400. The view of existing metadata is similar to a 
Venn diagram with non-overlapping sets. (If the sets were allowed to overlap - that is to 
say, one media object can have two or more metadata values applied to it - identical 
copies of the same object may appear in several of the boxes). All items are originally 
35 located in the "unclassified" category 400. The process of adding metadata to media 
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objects relies on 'dragging and dropping* objects in and out of classification boxes (step 
76), each box being representative of a specific metadata value. The visual representation 
of the contents of each category makes it straightforward to ensure that similar media 
objects are placed in the same category (allocated the same value). A user can also 
5 identify which categories are most heavily populated, and therefore worthy of subdivision. 
This would allow the distribution of the metadata in a database to be "flattened" i.e. all the 
media objects should have similar amount of metadata associated with them. 

The list controls are populated by inserting the metadata associated with the 
media object. To do this the user 'sorts' the media objects using a "drag and drop" 

10 function (76), for example using the left button of a computer "mouse". For the particular 
media object that is moved by that operation, the metadata value originally stored for it, 
which is represented by the box in which it was originally located, is replaced by the value 
represented by the box to which it is moved (step 77). Moving from the "unclassified" area 
adds a value where none was previously recorded. If it is desired to delete a value, the 

15 object is moved from the box representing that value to the "unclassified" box. 

In a single view, if the metadata element is extensible - that is to say, it can have 
multiple values - such as "actors", moving to "unclassified" only removes one actor (the 
one it was moved from) - the others are unchanged. The icon that was moved to 
"unclassified" would be deleted if other values for the element still exist for that object. 

20 Deletion operates in a different way when multi-dimensional views are in use, as will be 
discussed later. 

If a value is to be added, rather than replace an existing one, a "right click" drag- 
and-drop operation would generate a copy (step 78) - in other words for that particular 
media object the metadata value represented by the box that it is "dropped" in is copied to 
25 that particular element but not deleted from the original element. 

In this case a check is made (step 761) to ensure that the "copy" operation is 
valid: in other words to check that the origin and destination metadata elements are not 
mutually exclusive. If the multi dimensional view is being used, as will be discussed with 
reference to Figure 5, a check is also made as to whether the proposed move is between 
30 boxes that represent different metadata elements, and not just different values in the 
vocabulary of one metadata element. An error message (79) is generated if such an 
attempt is made. Attempting to copy to or from the "unclassified" area would also 
generate an error message. 

The processor 10 automatically populates the list boxes 401, 402, 403, etc by 
35 querying the database 15 for media objects that contain the metadata values that are 
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represented by the boxes to be displayed. The unclassified box is populated by running a 
NOT query for all the metadata values. 

In the single-dimensional view shown in Figure 4, the user selects a single 
metadata element 41 from the hierarchical menu structure 30 (step 70). In the illustrated 
5 example this is the "classification" element 41. The media objects are now sorted 
according to the vocabulary values of their respective metadata elements (step 71) - for 
example, "advertising" 401, "affairs" 402, "documentary" 403, "Drama" 404, "Education" 
405, "Film" 406 etc. An empty box may appear - this denotes a metadata value defined in 
the database vocabulary but not used by any of the objects. An unclassified box 400 
1 0 contains media objects (e.g. 1 00) that do not contain any values for the selected metadata 
element. 

As shown in Figures 5 and 6, more than one metadata element may be selected 
by identifying more than one checkbox (step 701). The user may select whether to display 
a multi dimensional view (depicted in Figure 5, step 72) or an "Intersection" view (steps 
15 73-75, depicted in Figure 6). In the multidimensional view shown in Figure 5 the metadata 
elements Action 51, FOV (field of view) 52 and Pace 53 have been selected. This 
produces three sets of list controls: one for each of the metadata elements. These three 
sets may be denoted in different colours to indicate boxes or values that belong to the 
same vocabulary element in the multi dimension view, to improve clarity for the user. This 
20 view provides the user a simple way of visualising the metadata values in different 
metadata elements of the media objects. 

As in the single dimensional view (Figure 4), the list controls 511-513, 521-525, 
531-532, in each set represent metadata values for a given metadata element 51, 52, 53. 
An individual metadata element may appear in more than one set, if it has a value for 
25 each element represented. The 'Unclassified' box 500 contains the media objects that do 
not contain any of the metadata values that are represented by all the list controls. 

In this view, copying (Right click drag and drop) between different list control 
groups 51, 52 (Different colours) is always allowed (steps 76, 761, 78). This is because 
this copying is merely adding a new value to a different metadata element: if there is 
30 already a value in that given metadata element it will be replaced. Copying between list 
controls 511, 512 in the same metadata element set 51 (Same colour) is only possible if 
that particular metadata element is of a type that allows multiple values. In the multi 
selection view, moving an object to the "unclassified" area would only delete the value of 
the metadata element from which it was moved. For example, an object 107 appears in 
35 both the "fight" box 512 (Action element 51) and the "fast" box 532 (Pace element 53). If it 
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is moved from "fast" 532 to "unclassified" 500, only the value in the "pace" metadata 
element 53 would be deleted - the "Action" element 512 would remain unchanged 

An "Intersection" View (step 73), as shown in Figure 6, may instead be selected 
by the user (step 702). This view is similar to the multi-dimensional view described with 
5 reference to Figure 5. Again the user selects multiple values from the hierarchical menu 
structure using check boxes, in the example shown in Figure 6. Here the "Actors" 
metadata element 61 and the "Place" metadata element 62 have been selected. (The 
latter is a subdivision of the "location" element 63 in the hierarchical menu structure). The 
user has selected three list controls as the maximum to display. Hence three 

10 combinations of Actor and Location values (Actor=John, Peter, Sarah: Location=Beach) 
are generated (step75). Alternatively if the user sets the number of list controls to a higher 
value, other list controls would be generated (Individual list controls for each actor and 
location). In the "intersection" view, if an object is moved from an intersection box to 
"unclassified", all the values represented in that intersection would be deleted. For 

15 example, by moving an object 106 from the "Peter/beach" box in Figure 6, the value 
"Peter" would be removed from the "actor" element, and the value in the "location" 
element ("beach") would also be deleted. 

In this view, the number of possible list controls is the product of the number of 
terms ni n 2 n 3 in each category (if mutually exclusive), or 2 N where N is the sum 

20 of the number of terms in each category that may be applied (if not mutually exclusive) ^ 

+ n 2 + n 3 + This may be a much larger number of categories than can 

conveniently be accommodated on a conventional display device, so the user is given the 
facility to limit the number of list controls represented (step 74), by using a slider 64 or 
some other selection means. The number of metadata elements the user can 

25 simultaneously choose is also restricted to three. This is to prevent excessive processing 
time and system running out of resources. 

By identifying the boxes that contain the most objects, and which ones have the 
least or even none, the user can decide which vocabulary terms are most relevant to a 
database and get a much better idea of how much more marking-up he needs to do. 

30 Moreover, the multi-dimensional view (Figure 5) and intersection view (Figure 6) afford 
visual hints to the user to fill in missing metadata values.. The user can graphically see 
which metadata element or values produce which cluster. So by using this in combination 
with a template the user can easily generate the movie clips he desires by adding markup 
just to the clips he is interested in. The user can see the markup of an object relative to all 
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the others in the database, and so he is given an idea of how much more markup is 
needed to add it into a cluster or differentiate it when it is in a cluster. 

The invention may be used to compile a media article such as a television 
programme using a wide variety of media, such as text, voice, other sound, graphical 
5 information, still pictures and moving images. In many applications it would be desirable to 
personalise the media experience for the user, to generate a "bespoke" media article. 

Digital sets of stored media objects can be stored in a variety of formats, for 
example, a file may merely contain data on the position of components seen by a user 
when playing a computer game - the data in that file subsequently being processed by 
1 0 rendering software to generate an image for display to the user. 

A particular use of such metadata is described in International Patent application 
PCT/GB2003/003976, filed on 15 th September 2003, which is directed to a method of 
automatically composing a media article comprising: 

analysing digital metadata associated with a first set of stored media objects, 
15 which digital metadata includes: 

related set identity data identifying a second set of stored media objects; 
and relationship data which indicates the relationship between what is 
represented by the first set of stored media objects and what is represented by the second 
set of stored media objects; 

20 and arranging said first and second sets of stored media objects in a media 

article in accordance with said analysis. 

This uses detailed formal and temporal metadata of the kind already described, 

(e.g. identifying individual actors or locations appearing in a video item, and time and 

sequence related information). A set of filters and combiners are used to construct a 
25 narrative by arranging media objects in a desired sequence. The present invention may 

be used to apply metadata records for each media object available for use in such a 

compilation. 
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CLAIMS 

1 . A data handling device for organising and storing media objects for subsequent 
retrieval, comprising a display for displaying representations of the media objects, a user 
5 input device for selecting an individual media object, and data storage means for 
allocating the selected media object to a designated category selected using the user 
input device, wherein a representation of the selected media object is placed in a region of 
the display representing the designated category. 

10 2. A device according to claim 1, in which icons representing the media objects are 
capable of being moved between regions of the display area representing categories 
having pre-defined vocabulary values. 

3. A device according to claim 1 or claim 2, configured to allow a user to generate 
15 additional categories having new vocabulary values, such that the media objects may be 

further categorised. 

4. A device according to any preceding claim, configured to allow a user to select a 
plurality of categories 

20 

5. A device according to claim 4, configured to provide a view of data items, to any of 
which one or more of a plurality of categories have been applied. 

6. A device according to claim 4 or claim 5, configured to provide a view of data 
25 items, to each of which a plurality of categories have been applied. 

7. A device according to claim 6, wherein means are provided to provide user control 
of the maximum number of categories to be displayed. 

30 8. A device according to any preceding claim wherein a selected element may be 
removed from one display area when added to another. 

9. A device according to any preceding claim wherein an element selected from one 
display area may remain there, and a copy placed in the additional area. 

35 



30445.doc 



14 

10. A device according to any preceding claim, providing means for indicating the 
number of categories associated with one or more media objects 

11. A device according to claim 10, providing means for identifying media objects to 
5 which no categories have been applied 

12. A device according to any preceding claim, providing means for selecting a subset 
of the media objects for allocating to a given category. 

10 13. A device according to any preceding claim, providing means for making the size of 
the display area allocated to each category proportional to the number of media objects 
portrayed therein. 

14 A computer program or suite of computer programs for use with one or more 
15 computers to provide any of the apparatus as set out in any one of claims 1 to 13. 

15. A method of organising and storing media objects for subsequent retrieval, in 
which representations of the media objects are displayed, a user selects an individual 
media object from the display, and allocates the selected media object to a designated 

20 category by placing a representation of the selected media object in a region of the 
display representing the designated category, whereby data relating to the selected media 
object is stored associating it with the category so identified. 

16. A method according to claim 15, in which icons representing the media objects are 
25 capable of being moved between regions of the display area representing categories 

having pre-defined vocabulary values. 

17. A method according to claim 16, in which a user may generate additional 
categories having new vocabulary values, such that the media objects may be further 

30 categorised. 

18. A method according to claim 15, 16 or 17, wherein a user may select a plurality of 
categories 
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19. A method according to claim 18, wherein a view is provided of data items to any of 
which one or more of a plurality of categories have been applied. 

20. A method according to claim 18 or claim 19, wherein a view is provided of data 
5 items to each of which a plurality of categories have been applied. 

21. A method according to claim 20, wherein provision is made to control the 
maximum number of categories to be displayed. 

10 22. A method according to any of claims 15 to 21, wherein a selected element may be 
removed from one display area when added to another. 

23. A method according to any of claims 15 to 22 wherein an element selected from 
one display area may remain there, and a copy placed in the additional area. 

15 

24. A method according to any preceding claim, wherein the number of categories 
associated with one or more media objects is indicated 

25. A method according to claim 24, wherein media objects to which no categories 
20 have been applied are identified 

26. A method according to any of claims 15 to 25, wherein a subset of the media 
objects may be selected for allocation to a given category. 

25 27. A method according to any of claims 15 to 26, wherein the size of the display area 
allocated to each category is proportional to the number of media objects portrayed 
therein. 

28 A computer program or suite of computer programs for use with one or more 
30 computers to provide the method of any one of claims 1 5 to 27. 
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ABSTRACT 

Data Handling System 




5 Media objects such as film clips that have been stored for subsequent retrieval are 
represented in a display. In order to apply a classification or indexing scheme to the 
objects, a user selects an individual media object from the display, and allocates the 
selected media object to a designated category by placing a representation of the 
selected media object in a region of the display representing the designated category, 
10 whereby data relating to the selected media object is stored associating it with the 
category so identified. 
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